Binary Neural Networks Algorithms, Architectures, and Applications (Baochang Zhang, Sheng Xu, Mingbao Lin etc.)

MCN: Modulated Convolutional Network

FIGURE 3.5

Accuracy with diﬀerent K for 20-layer MCNs with width 16-16-32-64 on CIFAR-10.

columns show the accuracies of U-MCNs and MCNs, respectively. The performance in the

last three columns shows that the accuracy of MCNs only decreases slightly when binarized

ﬁlters are used. Note that with a ﬁxed number of convolutional layers, the performance of

MCNs increases with larger network width. At the same time, the number of parameters

also increases. Compared to LBCNN, the parameters of the MCNs are much fewer (61 M

vs. 17.2 M), but the performance of the MCNs is much better (92.96% vs. 95.30%). Also,

the last three columns show that MCNs have achieved performance similar to U-MCNs and

WRNs.

3.4.5

Model Eﬀect

Learning convergence: The MCNs model is based on a binarized process implemented

on the Torch platform (classiﬁcation). For a 20-layer MCN with width 16-16-32-64 that is

trained after 200 epochs, the training process takes about 3 hours with two 1080ti GPUs. We

plot the training and testing accuracy of MCNs and U-MCNs in Fig. 3.10. The architecture

of U-MCNs is the same as that of MCNs. Figure 3.10 clearly shows that MCNs (the blue

curves) converge at speeds similar to those of their unbinarized counterpart (the red curves).

Runtime analysis: We performed a run-time analysis to compare MCNs and LBCNN.

The runtimes of MCNs and LBCNN for all CIFAR-10 test samples are 8.7 s and 160.6 s,

Conv

3×3, 80

R+

Output

Input

image

Input

image

Copy

CNN

MCN

MP: Max Pooling

R: ReLU

BN: BatchNormlization

D: Dropout

MCcov

4×3×3, 20

R+

Conv

3×3, 160

R+

Conv

3×3, 320

R+

Conv

3×3, 640

R+

1024

MCcov

4×3×3, 40

R+

MCcov

4×3×3, 80

R+

MCcov

4×3×3, 160

R+

1024

Output

FIGURE 3.6

Network architectures of CNNs and MCNs.